659 research outputs found

    Using Machine Learning to Identify Patterns in Learner-Submitted Code for the Purpose of Assessment

    Get PDF
    Programming has become an important skill in today’s world and is taught widely both in traditional and online settings. Instructors need to grade increasing amounts of student work. Unit testing can contribute to the automation of the grading process but it cannot assess the structure or partial correctness of code, which is needed for finely differentiated grading. This paper builds on previous research that investigated machine learning models for determining the correctness of programs from token-based features of source code and found that some such models can be successful in classifying source code with respect to whether it passes unit tests. This paper makes two further contributions. First, these results are scrutinized under conditions of varying similarity between code instances used for model training and testing, for a better understanding of how well the models generalize. It was found that the models do not generalize outside of groups of code instances performing very similar tasks (corresponding to similar coding assignments). Second, selected binary classification models are used as a base for multi-class prediction with two different methods. Both of these exhibit prediction success well above the random baseline, with potential to contribute to automated assessment with multi-valued measures of quality (grading schemes), in contrast to the binary pass/fail measure associated with unit testing

    Pacloud: Towards a Universal Cloud-based Linux Package Manager

    Full text link
    Package managers are a very important part of Linux distributions but we have noticed two weaknesses in them: They use pre-built packages that are not optimised for specific hardware and often they are too heavy for a specific need, or packagesmay require plenty of time and resources to be compiled. In this paper, we present a novel Linux package manager which uses cloud computing features to compile and distribute Linux packages without impacting the end user's performance. We also show how Portage, Gentoo's package manager can be optimised for customisation and performance, along with the cloud computing features to compile Linux packages more efficiently. All of this resulting in a new cloud-based Linux package manager that is built for better computing performance.Comment: Presented at CECCC 2019. 8 pages, 13 figure

    Clustering Weblogs on the Basis of a Topic Detection Method

    Get PDF
    In recent years we have seen a vast increase in the volume of information published on weblog sites and also the creation of new web technologies where people discuss actual events. The need for automatic tools to organize this massive amount of information is clear, but the particular characteristics of weblogs such as shortness and overlapping vocabulary make this task difficult. In this work, we present a novel methodology to cluster weblog posts according to the topics discussed therein. This methodology is based on a generative probabilistic model in conjunction with a Self-Term Expansion methodology. We present our results which demonstrate a considerable improvement over the baseline

    Prototype/topic based Clustering Method for Weblogs

    Full text link
    [EN] In the last 10 years, the information generated on weblog sites has increased exponentially, resulting in a clear need for intelligent approaches to analyse and organise this massive amount of information. In this work, we present a methodology to cluster weblog posts according to the topics discussed therein, which we derive by text analysis. We have called the methodology Prototype/Topic Based Clustering, an approach which is based on a generative probabilistic model in conjunction with a Self-Term Expansion methodology. The usage of the Self-Term Expansion methodology is to improve the representation of the data and the generative probabilistic model is employed to identify relevant topics discussed in the weblogs. We have modified the generative probabilistic model in order to exploit predefined initialisations of the model and have performed our experiments in narrow and wide domain subsets. The results of our approach have demonstrated a considerable improvement over the pre-defined baseline and alternative state of the art approaches, achieving an improvement of up to 20% in many cases. The experiments were performed on both narrow and wide domain datasets, with the latter showing better improvement. However in both cases, our results outperformed the baseline and state of the art algorithms.The work of the third author was carried out in the framework of the WIQ-EI IRSES project (Grant No. 269180) within the FP7 Marie Curie, the DIANA APPLICATIONS Finding Hidden Knowledge in Texts: Applications (TIN2012-38603-C02-01) project and the VLC/CAMPUS Microcluster on Multimodal Interaction in Intelligent Systems.Perez-Tellez, F.; Cardiff, J.; Rosso, P.; Pinto Avendaño, DE. (2016). Prototype/topic based Clustering Method for Weblogs. Intelligent Data Analysis. 20(1):47-65. https://doi.org/10.3233/IDA-150793S476520

    Weblog and short text feature extraction and impact on categorisation

    Full text link
    The characterisation and categorisation of weblogs and other short texts has become an important research theme in the areas of topic/trend detection, and pattern recognition, amongst others. The value of analysing and characterising short text is to understand and identify the features that can identify and distinguish them, thereby improving input to the classification process. In this research work, we analyse a large number of text features and establish which combinations are useful to discriminate between the different genres of short text. Having identified the most promising features, we then confirm our findings by performing the categorisation task using three approaches: the Gaussian and SVM classifiers and the K-means clustering algorithm. Several hundred combinations of features were analysed in order to identify the best combinations and the results confirmed the observations made. The novel aspect of our work is the detection of the best combination of individual metrics which are identified as potential features to be used for the categorisation process.The research work of the third author is partially funded by the WIQ-EI (IRSES grant n. 269180) and DIANA APPLICATIONS (TIN2012-38603-C02-01), and done in the framework of the VLC/Campus Microcluster on Multimodal Interaction in Intelligent Systems.Perez Tellez, F.; Cardiff, J.; Rosso, P.; Pinto Avendaño, DE. (2014). Weblog and short text feature extraction and impact on categorisation. Journal of Intelligent and Fuzzy Systems. 27(5):2529-2544. https://doi.org/10.3233/IFS-141227S2529254427

    Selenium and impaired physical function in US and Spanish older adults

    Get PDF
    Background: Selenium (Se) is a trace element with a narrow safety margin. Objectives: To evaluate the cross-sectional and longitudinal dose-response association between Se exposure and measures of impaired physical function and disability in older adults. Design: NHANES 2011–2014 cross-sectional (US, n = 1733, age ≥60 years) and Seniors-ENRICA-2 2017–2019 cross-sectional and longitudinal (Spain, n = 2548 and 1741, respectively, age ≥65 years) data were analyzed. Whole blood and serum Se levels were measured using inductively coupled plasma-mass spectrometry. Lowerextremity performance was assessed with the Short Physical Performance Battery, and muscle weakness with a dynamometer. Incident mobility and agility limitations, and disability in instrumental activities of daily living (IADL) were ascertained with standardized questionnaires. Analyses were adjusted for relevant confounders, including physical activity. Results across studies were pooled using random-effects meta-analysis. Results: Meta-analyzed odds ratios (95% confidence interval) per log2 increase in whole blood Se were 0.54 (0.32; 0.76) for weakness, 0.59 (0.34; 0.83) for impaired lower-extremity performance, 0.48 (0.31; 0.68) for mobility limitations, 0.71 (0.45; 0.97) for agility limitations, and 0.34 (0.12; 0.56) for disability in at least one IADL. Analyses for serum Se in NHANES showed similar results. Findings suggest the inverse association with grip strength is progressive below 140 μg/L (p-value for non-linear trend in the Seniors-ENRICA-2 study = 0.13), and above 140 μg/L (p-value for non-linear trend in NHANES = 0.11). In the Seniors-ENRICA-2 cohort, with a 2.2 year follow-up period, a doubling in baseline Se levels were associated with a lower incidence of weakness [odds ratio (95% confidence interval): 0.45 (0.22; 0.91)], impaired lower-extremity performance [0.63 (0.32; 1.23)], mobility [0.43 (0.21; 0.91)] and agility [0.38 (0.18; 0.78)] limitations. Discussion: In US and Spanish older adults, Se concentrations were inversely associated with physical function limitations. Further studies are needed to elucidate underlying mechanisms.Instituto de Salud Carlos III European Commission PI18/287 16/609State Secretary of R + D + I PID2019-108973RB-C21/C22European Social Fund (ESF) European Commissio

    Dietary determinants of cadmium exposure in the Strong Heart Family Study

    Get PDF
    Urinary cadmium (Cd) concentrations in the Strong Heart Family Study (SHFS) participants are higher than in the general US population. This difference is unlikely to be related to tobacco smoking. We evaluated the association of consumption of processed meats and other dietary products with urinary Cd concentrations in the SHFS, a family-based study conducted in American Indian communities. We included 1725 participants with urine Cd concentrations (standardized to urine creatinine) and food frequency questionnaire data grouped in 24 categories, including processed meat. Median (IQR) urinary Cd concentrations were 0.42 (0.20–0.85) μg/g creatinine. The age, sex, smoking, education, center, body mass index, and total kcal adjusted geometric mean ratio (GMR) (95%CI) of urinary cadmium concentrations per IQR increase in each dietary category was 1.16 (1.04–1.29) for processed meat, 1.10 (1.00–1.21) for fries and chips, 0.87 (0.80–0.95) for dairy products, and 0.89 (0.82–0.97) for fruit juices. The results remained similar after further adjustment for the dietary categories associated with urinary Cd in the previous model except for fries and chips, which was no longer statistically significant. These findings revealed the potential importance of processed meat products as a dietary source of cadmium

    Ciencias de la Biología y Agronomía

    Get PDF
    Este volumen I contiene 17 capítulos arbitrados que se ocupan de estos asuntos en Tópicos Selectos de Ciencias de la Biología y Agronomía, elegidos de entre las contribuciones, reunimos algunos investigadores y estudiantes. Se presenta un Estudio Comparativo de los Recursos Hidrológico-Forestales de la Microcuenca de la Laguna de Epatlan, Pue. (1993 a 2014); la Situación Actual de la Mancha de Asfalto en Maíz (Zea mays L.) en los Municipios de Jiquipilas y Ocozocoautla, Chiapas, México; las poblaciones sobresalientes de maíz de la raza Zapalote Chico, en la Región Istmeña de Oaxaca; Se indica el índice de área foliar de cultivo de Chile Poblano mediante dos métodos en condiciones protegidas; Esquivel, Urzúa y Ramírez exploran el efecto de la biofertilización con Azospirillum en el crecimiento y producción de Jitomate; esbozan su artículo sobre la determinación del nivel de Heterosis en híbridos de Maíz para la Comarca Lagunera; una investigación sobre la estabilización de semilla de Solanum lycopersicum durante el almacenamiento y estimulación de la germinación; acotan sobre el CTAB como una nueva opción para la detección de Huanglongbing en cítricos, plantean su evaluación sobre el aluminio y cómo afecta la vida de florero de Heliconia psittacorum; indican sobre el impacto del H-564C, como un híbrido de maíz con alta calidad de proteina para el trópico húmedo de México; presetan su investigación sobre la producción de Piña Cayena Lisa y MD2 (Ananas comosus L.) en condiciones de Loma Bonita, en Oaxaca; acotan sobre el efecto de coberteras como control biológico por conservación contra áfidos en Nogal Pecanero; esbozan sobre la caracterización de cuatro genotipos de Frijol Negro en Martínez de la Torre, Veracruz, México; presentan una caracterización hidroecológica de la microcuenca de Arroyo Prieto, Yuriría, Gto., y alternativas para su restauración ambiental; presentan su investigación sobre el efecto del hongo Beauveria bassiana sobre solubilización de fosfatos y la disponibilidad de fósforo en el suelo; plantean su investigación sobre la Germinación y regeneración in vitro de Epidendrum falcatum LINDL; esbozan su artículo sobre genotipos de frijol negro y su tolerancia a sequía terminal en Veracruz, México

    Severe Asthma Standard-of-Care Background Medication Reduction With Benralizumab: ANDHI in Practice Substudy

    Get PDF
    Background: The phase IIIb, randomized, parallel-group, placebo-controlled ANDHI double-blind (DB) study extended understanding of the efficacy of benralizumab for patients with severe eosinophilic asthma. Patients from ANDHI DB could join the 56-week ANDHI in Practice (IP) single-arm, open-label extension substudy. Objective: Assess potential for standard-of-care background medication reductions while maintaining asthma control with benralizumab. Methods: Following ANDHI DB completion, eligible adults were enrolled in ANDHI IP. After an 8-week run-in with benralizumab, there were 5 visits to potentially reduce background asthma medications for patients achieving and maintaining protocol-defined asthma control with benralizumab. Main outcome measures for non-oral corticosteroid (OCS)-dependent patients were the proportions with at least 1 background medication reduction (ie, lower inhaled corticosteroid dose, background medication discontinuation) and the number of adapted Global Initiative for Asthma (GINA) step reductions at end of treatment (EOT). Main outcomes for OCS-dependent patients were reductions in daily OCS dosage and proportion achieving OCS dosage of 5 mg or lower at EOT. Results: For non-OCS-dependent patients, 53.3% (n = 208 of 390) achieved at least 1 background medication reduction, increasing to 72.6% (n = 130 of 179) for patients who maintained protocol-defined asthma control at EOT. A total of 41.9% (n = 163 of 389) achieved at least 1 adapted GINA step reduction, increasing to 61.8% (n = 110 of 178) for patients with protocol-defined EOT asthma control. At ANDHI IP baseline, OCS dosages were 5 mg or lower for 40.4% (n = 40 of 99) of OCS-dependent patients. Of OCS-dependent patients, 50.5% (n = 50 of 99) eliminated OCS and 74.7% (n = 74 of 99) achieved dosages of 5 mg or lower at EOT. Conclusions: These findings demonstrate benralizumab's ability to improve asthma control, thereby allowing background medication reduction
    corecore